Scenario 0: The pre/post correlation or raw data is reported.
This is the ideal scenario where the pre/post Pearson correlation is reported in the primary study or the raw data is available. If the correlation is not reported, but the raw data is available then we will have to calculate the pre/post correlation ourselves. This can be done easily in base R using the cor()
function. If neither the raw data or Pearson correlation is available, contact the authors of the primary study to obtain the raw data.
If the pre/post correlation is reported as a Spearman rank-order correlation instead of the Pearson correlation, we can use the Spearman rank-order correlation (\(r^R_{12}\)) to approximate the Pearson correlation assuming the underlying distribution is a bivariate normal (Rupinski and Dunlap 1996, eq. 2),
\[
r_{12} \approx 2\sin^{-1}\left(\frac{\pi \times r^{R}_{12}}{6}\right)
\qquad(9)\]
However, if the pre/post correlation is reported as a Kendall’s \(\tau\) coefficient then the correlation tends to be under-estimated. Therefore, we can use Kendall’s (1962) formula for converting to a Pearson correlation (assuming an underlying bivariate normal distribution),
\[
r_{12} \approx \sin^{-1}\left(\frac{\pi\times \tau_{12}}{2}\right)
\qquad(10)\]
Note that Equation 9 and Equation 10 are merely approximations and therefore it is recommended to move on to scenarios that allow for an exact calculation of the pre/post correlation before using these formulas.
Acknowledgments
The authors express our sincere gratitude for the invaluable insights provided by X. Through their generosity and expertise, this study has been significantly enhanced and safeguarded against numerous potential errors. Any remaining errors are solely our responsibility.
Cuijpers, P., E. Weitz, I. A. Cristea, and J. Twisk. 2017. “Pre-Post Effect Sizes Should Be Avoided in Meta-Analyses.” Epidemiology and Psychiatric Sciences 26 (4): 364–68. https://doi.org/10.1017/S2045796016000809.
Jané, Matthew B., Qinyu Xiao, Siu Kit Yeung, Mattan Ben Shachar, Aaron Caldwell, Denis Cousineau, Daniel Dunleavy, et al. 2024. Guide to Effect Sizes and Confidence Intervals. https://doi.org/10.17605/OSF.IO/D8C4G.
Kendall, Maurice George. 1962. Rank Correlation Methods. Griffin.
Luo, Dehui, Xiang Wan, Jiming Liu, and Tiejun Tong. 2018. “Optimally Estimating the Sample Mean from the Sample Size, Median, Mid-Range, and/or Mid-Quartile Range.” Statistical Methods in Medical Research 27 (6): 1785–805. https://doi.org/10.1177/0962280216669183.
Pearson, Karl, and L. N. G. Filon. 1897. “Mathematical Contributions to the Theory of Evolution. IV. On the Probable Errors of Frequency Constants and on the Influence of Random Selection on Variation and Correlation. [Abstract].” Proceedings of the Royal Society of London 62: 173–76. https://www.jstor.org/stable/115709.
Rupinski, Melvin T., and William P. Dunlap. 1996. “Approximating Pearson Product-Moment Correlations from Kendall’s Tau and Spearman’s Rho.” Educational and Psychological Measurement 56 (3): 419–29. https://doi.org/10.1177/0013164496056003004.
Shi, Jiandong, Dehui Luo, Hong Weng, Xian-Tao Zeng, Lu Lin, Haitao Chu, and Tiejun Tong. 2020. “Optimally Estimating the Sample Standard Deviation from the Five-Number Summary.” Research Synthesis Methods 11 (5): 641–54. https://doi.org/10.1002/jrsm.1429.
Viechtbauer, Wolfgang. 2010. “Conducting Meta-Analyses in R with the metafor Package.” Journal of Statistical Software 36 (3): 1–48. https://doi.org/10.18637/jss.v036.i03.
Wan, Xiang, Wenqian Wang, Jiming Liu, and Tiejun Tong. 2014. “Estimating the Sample Mean and Standard Deviation from the Sample Size, Median, Range and/or Interquartile Range.” BMC Medical Research Methodology 14 (1): 135. https://doi.org/10.1186/1471-2288-14-135.